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1 Introduction 


Maithili is the traditional writing system for the Maithili language, which is spoken predominantly in the state 
of Bihar in India and in the Narayani and Janakpur zones of Nepal. The script is also known as Mithilaksara 
and Tirahuta, names which refer to Mithila, or Tirhut, as these regions of India and Nepal are historically 
known. Maithili is a Brahmi-based script derived from Gaudi, or ‘Proto-Bengali’, which evolved from the 
Kutila branch of Brahmi by the 10th century св.! It is related to Bengali, Newari, and Oriya, which are also 
descended from Gaudi, and became differentiated from these scripts by the 14th century. It remained the 
dominant writing system in Mithila from that period until the 20th century. 


Maithili is associated with a scholarly tradition that spans six centuries, as attested by manuscripts and in- 
scriptional records. Uniquely Maithili manuscripts consist of the vernacular songs of the poet Vidyapati 
Thakura (14th century) and Sanskrit treatises on Nyaya philosophy; moreover, Maithili is the traditional 
script for Sanskrit in the Mithila region. Inscriptions in Maithili are located in Bihar and Nepal. The Maithil 
Brahmin community has used the script for maintaining райй or genealogical records since the 14th cen- 
tury. Maithili metal fonts were developed in Kolkata (Calcutta) in the 1920s and books were first printed in 
the script at that time. Devanagari replaced Maithili in the 20th century; however, Maithili is still used for 
producing genealogical records, religious texts, and letters. The script is used in signage in north Bihar and 
is permitted as an optional script for writing the Bihar civil services examination. 


The Government of India recognized Maithili as a scheduled language in 2004, a status that ensures official 
support for the development of the language. The official status of Maithili has further revitalized interest in 
the traditional script for the language. In 2005, in a presentation to the Unicode Technical Committee, Dr. 
Om Vikas of the Department of Information Technology, Government of India noted the historical impor- 
tance of Maithili (Tirhuta) and expressed the Department's interest in establishing a Unicode standard for 
the script.? This development builds upon the activities of Maithili speakers in India and Nepal, who, in the 
past decade, have produced digitized Maithili fonts? and published dictionaries, books of poetry, novels in 
the Maithili script and language in print and digital media. Electronic journals on Maithili literature and cul- 
ture are published in the Maithili script,^ and there is an effort to develop resources such as Wikipedia in the 
Maithili language and script.? An encoding for Maithili in the UCS will support current activity and research 
on Maithili language and script, and will facilitate the development of new resources and technologies for 
Maithili. 


! Salomon 1998: 41. ? Vikas 2005: 31. ? eCube Solutions & Research 2003. “ Thakur 2009. Wikimedia 2008. 
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2 The Issue of Representing Maithili in Unicode 


Maithili is a writing system that is associated with a distinct linguistic community and a historically important 
Sanskritic scholarly tradition. It is a script that despite being nearly obsolete, has been preserved and has 
received renewed attention through revival efforts. An encoding for Maithili, therefore, may be viewed by its 
users as a means for preserving not only the script and the materials written in it, but also as the preservation 
of a cultural identity associated with the Maithili language and script. While such a perspective is certainly 
positive, the preservation of linguistic identity is not a directive of the UCS. The representation of scripts in 
the UCS is driven not by socio-cultural factors, but by the typological features and technical requirements 
of a writing system and by the design principles of character encoding. 


The primary issue facing the representation of Maithili in the UCS is its close affinity to the Bengali script. 
Maithili has several characters that are similar or identical to those already encoded for Bengali. However, 
the two scripts also differ in terms of character repertoires, glyph shapes, orthographic features, and rendering 
behaviors. The correspondences between the characters of Maithili and Bengali requires that a method for 
representing Maithili be established, which meets the requirements of the user community, while adhering 
to the design principles that govern the structure of Unicode. 


With regard to the encoding of Maithili, the most salient of the Unicode design principles is ‘plain text’. 
Plain text is the representation of character content that is independent of formatting. In other words, the 
display of Maithili in plain text means that the characters and orthographic features of the script are managed 
at the character level, not through font changes or alterations to text style. The user requirement to represent 
Maithili in plain text adheres to the structural principles of Unicode; however, the script must also conform 
to another design principle, namely, ‘unification’. Unification is the principle of preventing the duplicate 
encoding of characters within scripts across languages. According to this principle, while Maithili is distinct 
from Bengali in several respects, it is considered a candidate for unification with Bengali because of the 
number of correspondences between the two scripts. 


Taking into consideration the principles of ‘plain text’ and ‘unification’, there are three models for repre- 
senting Maithili in the UCS: 


1. Encode Maithili as an independent script 
2. Encode Maithili as a subset of Bengali 
3. Completely unify Maithili with Bengali 


3 Assessment of the Models for Representing Maithili 


Based upon the number of Maithili characters that correspondence with those of Bengali, it might be argued 
that Maithili is a variant of Bengali, and is therefore a candidate for unification with Bengali. This position 
suggests that Maithili should be accommodated in the UCS as a subset of Bengali either by (a) managing it 
at the presentation level through fonts, or (b) encoding Maithili-specific characters as extensions to Bengali. 
Both of These approaches are problematic: 


* Complete unification of Maithili with Bengali Unifying Maithili and Bengali is synonymous with 
defining Maithili as a variant of Bengali and establishing no standard for the script. Complete unifi- 
cation mandates that the representation and display of Maithili be bound to font control, ie. Maithili 
glyphs will be designed and assigned to the code points of corresponding Bengali characters in Bengali 
fonts. Thus, when users create or view Maithili content, the text will be represented in Bengali script; 
the user must chose a Bengali font designed to accommodate Maithili glyph in order to display Maithili 


Towards an Encoding for the Maithili Script т ISO/IEC 10646 Anshuman Pandey 


properly. Complete unification of Maithili with Bengali does not provide a means for representing the 
script in plain text. 


e Encoding Maithili as subset of Bengali The Bengali script is used for languages other than Bengali, 
such as Assamese and Manipuri. With the exception of two characters, Assamese orthography is iden- 
tical to that for Bengali. These two characters were added to the Bengali block in order to accommodate 
Assamese orthography in the UCS. Although these characters could be treated as glyphic variants of 
existing Bengali characters, a decision was made to encode them separately. The Assamese case of- 
fers a precedent for encoding scripts with large correspondences with an already-encoded script. The 
correspondences between Maithili and Bengali are much less than between Assamese and Bengali. If 
Assamese characters were added as atomic characters, then similarly, Maithili must at minimum be 
represented in the UCS as an extension to Bengali. This will require that at least 40 Maithili-specific 
characters be added for Bengali. Managing Maithili as an extension to Bengali will entail not only 
the addition of a large number of characters to an existing script, but require the development of new 
rendering rules for an already complex script. 


Aside from the technical difficulties related to unification of complex scripts, an equally troublesome issue 
with unification resides in the definition of the unification principle itself. The principle prevents against the 
encoding of duplicate characters within scripts across languages. Although Maithili and Bengali possess 
corresponding characters, unification of the two suggests that they are one and the same script. Unifying 
characters used in the orthographies of languages based upon the Latin script is justifiable; however, despite 
the correspondences between Latin and Cyrillic, characters in the two scripts that have identical semantic 
and glyphical value are encoded independently. The rationale is that unification operates within scripts, not 
across them. The Maithili language is not associated with the Bengali script, and the Maithili script is not 
considered a variant of the Bengali script. Therefore, to encode Maithili by unifying it with Bengali would 
be to unify characters across scripts, which contradicts Unicode principles. 


Presented below are technical considerations regarding unification, which 1$ followed by a discussion of 
other factors, such as user perception, that affect the method of representing Maithili as a script unified with 
Bengali. 


4 Technical Considerations 


4.4 Character Repertoire 


An analysis of the Maithili character repertoire indicates that it has several characters that are identical to 
Bengali, but it also has several character that are distinct from analogues in Bengali. Of the 85 characters 
proposed for Maithili (see tables 6 and 7), 39 characters are contrastive in some aspect from their Bengali 
analogues, while 6 characters are not found in Bengali (see tables 1, 2, and 3): 


* Vowel letters: A, AA, I, II, U, UU, VOCALIC R, VOCALIC RR, VOCALIC L, VOCALIC LL. 

* Vowel signs: II, U, UU, VOCALIC R, VOCALIC RR, VOCALIC L, VOCALIC LL, AI. 

* Consonants: NGA, CHA, JHA, TTA, DDHA, RHA, NAA, PHA, BA, BHA, RA, LA, SHA, HA. 

* Digits: THREE, FOUR, FIVE, SEVEN, EIGHT, NINE. 

* Various signs: ANUSVARA. 

* No correspondence: VOWEL SIGN SHORT E, LLA, VA, АМЛ, OM, GVANG, ABBREVIATION SIGN. 


Nearly all Maithili vowel letters contrast with Bengali characters. Only the independent forms of Е, АТ, О, 
and AU are identical. The majority of vowel signs are contrastive between the scripts. Maithili has signs 
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for SHORT E and SHORT о, which are not found in Bengali, although the latter corresponds glyphically to ct 
U+09D7 BENGALI AU LENGTH MARK, despite the name. 


Although the correspondences between consonant letters are greater, Maithili has characters with no Ben- 
gali analogues and there are important distinctions between corresponding characters. Maithili has distinct 
characters for Я BA and Я va, while Bengali does not have va. 


4.2 Size of Character Repertoire 


Unifying Maithili with Bengali is a more complex task than unifying Assamese with Bengali. The latter 
required the encoding of only two characters: Ч U+09FO BENGALI LETTER RA WITH MIDDLE DIAGONAL 
and 4 U+09F1 BENGALI LETTER RA WITH LOWER DIAGONAL. Such accommodations are acceptable when 
the difference between two scripts is a few characters. Representing Maithili as a subset of Bengali would 
require encoding at least 40 characters as extensions to Bengali. 


4.3 Homoglyphic Characters 


A unified encoding for Maithili with Bengali would prevent the encoding of duplicate characters, but it would 
not prevent the encoding of homoglyphic characters within a single script block. Maithili and Bengali have 
characters that are semantically contrastive, but glyphically identical, eg. Maithili 4 ка and Bengali 4 BA, 
Maithili Я va and Bengali Я кА. Other glyphical correspondences are not identical, but still similar: Maithili 
q LA and Bengali $ NNa, Maithili *T мма and Bengali 1 LA. This issue affects not only characters, but also 
character elements. The element > is a contextual form of VOWEL SIGN U in Maithili, but a conjoining form 
of BA in Bengali. Without appropriate linguistic context, the semantic value of the form Ч is ambiguous; it 
can represent either Maithili sa + VOWEL SIGN U or Bengali SA + VIRAMA + BA. 


4.4 Variant Characters 


Maithili has variant forms of characters and conjuncts. For instance, the letter 4 RA has two forms: (a) X and 
(b) 4. These variants produce consonant-vowel ligatures: (a) * and я, and (b) + and «i. An example of a 
variant conjunct form is that of JA + VIRAMA + NYA, which is represented by both 8 and &. Some character 
elements have variant forms, eg. the conjunct-onset form of ТА is represented as с> and ©. All variant 
forms for Maithili would be managed at the presentation level, as is done for other Indic scripts. 


4.5 Rendering Requirements 
4.5.1 Representation of Corresponding Characters 


Although several Maithili characters have glyph shapes identical to Bengali characters, the manner of rep- 
resenting combinations of corresponding characters often differ in the two scripts. 


* The letter cA has the shape $ in both Maithili and Bengali, but the sequence CA + VIRAMA + CA is 
written in Maithili as & and as ® in Bengali. 


* The letter TA has the shape © in both scripts. The sequence TA + VIRAMA + YA is written in Maithili as 
@ and in Bengali as Vj. 


* The letter Ў MA is identical in both scripts, but Maithili has a specific shaping requirement for MA + 
VIRAMA at a word boundary, where the sequence is represented as 2 not as Ў. 
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Another issue arises with the possibility of character sequences consisting of a mixture of Maithili- and 
Bengali-specific characters: 


* NGA + VIRAMA + GA is represented in Maithili as *Xj and in Bengali as %. The letter *?! GA is common 
between Maithili and Bengali, but NGA is © in Maithili and € in Bengali. 


* RA + VIRAMA + GA, which is represented in Maithili as the ligature Ж, while in Bengali this combination 
is rendering using repha as 4. 


Given the requirement to represent Maithili in plain-text, such rendering behaviors cannot be considered a 
matter of font control or through application of language-specific rules. If Maithili and Bengali are unified, 
it will be necessary to establish methods of rendering sequences of corresponding characters that are repre- 
sented differently in the two scripts, as well as sequences of mixed characters. Such rendering requirements 
could potentially be met through the use of control characters such as zws. However, this would conflict 
with existing rules for producing conjuncts. Moreover, such solutions impose an additional burden upon 
users. 


4.5.2 Contextual Behaviors of Vowel Signs 


e 


Contextual forms of VOWEL SIGN О Тһе vowel sign U ¢ has the contextual form ©, which is used 
with certain consonants and is written with these letters as a ligature. 


gu ju nu du nu pu mu lu su su 


Ман: A @® * Яя 
mes CEL TELAT TSF 


In Bengali, this element represents ba-phala and is used only in the creation of conjuncts. For example, 
Е represents su in Maithili, but sva in Bengali; in Maithili sva is written as Ж. 


° 


Ligatures formed with VOWEL SIGNU Special ligatures are used to write certain consonant-vowel 
combinations with v: 


ku tu dhu bhu yu yu ru su hu 


мш ® © & FAJFA 
Basi FEY Ww y y 93-9 
Ligature forms of VOWEL SIGN UU The form of the sign for the vowel UU is 2. Most consonant- 


vowel combinations involving Uu are written with the dependent vowel form, except for a few that 
are written as special ligatures: 


e 


ke. Ghat ha 
манын A A «m Mm 
Bengali Ф q 4 $ 


Ligature forms of VOWEL SIGN VOCALICR Тһе dependent sign for VOCALICR is 2. Certain consonant- 
vowel combinations with VOCALIC В are written as ligatures: 


e 
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ky ү br bhr hr 
Maithili 4 $ За 
Bengali + Ф Я Ф = 


4.5.3 Contextual Behaviors of Consonants 


Some Maithili consonants change shape in certain conditions. 


Maithili uses phala or primitive forms for producing consonant conjuncts with particular consonants. Both 
Maithili and Bengali use phala forms of YA and RA, when these consonants are the final element of a con- 
sonant conjunct. The phala form of 3 ҮА 15 £3; that of вА is 2. However, in Maithili, phala forms are also 
used for NA, LA, and vA when writing conjuncts. This practice is maintained in Oriya, another script derived 
from Proto-Bengali. 


LETTER TA There are two contextual forms of © TA: 


1. Conjunct onset When TA is the first consonant т a cluster its glyph shape is determined by the second 
consonant. If the second consonant is TA, YA, RA, or VA, the conjunct is written as a ligature. With all 
other consonants, TA is represented as the two-part form (©з. The explicit VIRAMA is written beneath 
the second consonant, eg. saratkdla is written in Maithili as 74С%4 and in Bengali аз 714%. А 
variant form attested in recent printed materials is written as ©, eg. 746%. This behavior of ТА is 
similar to Bengali € KHANDA TA; however, unlike KHANDA ТА, it is unnecessary to encode 02 as an 


atomic character because the contexts for its use are predictable. 


2. Word-final When TA + VIRAMA occurs in a word-final position it is written as ® not as ©. 


LETTER NA Some consonant conjuncts involving * NA are produced using the phala form 2 Of NA: eg 
khna 4 and phna $ аге two examples. 


LETTERBA  Theletter 4 BA has a contextual form when it appears in conjuncts and certain consonant-vowel 
combinations: 


1. Consonant-vowel combinations The letter BA take the conjunct contextual form when it is written 
with VOWEL SIGN VOCALIC R. This combination is written not as *$, but as &. 


2. Conjunct onset When BA appears as the first element of a conjunct with certain consonants it takes 
the shape P, eg. bda X. 


3. Conjunct final When BA appears as the final element of a conjunct it is represented by $ ba-phala, 
eg. mba Я, cf. ти 4 and туа Ў. 


LETTER MA When МА + VIRAMA occurs at a word-final position, the shape of ¥ changes. The combination 
MA + VIRAMA is written as 4 not as X. 


LETTER YA [n consonant clusters where YA is the second element, it takes the phala form. The use of ya 
phala in Maithili is identical to that in Bengali. However, in Maithili the ya phala joins with consonants to 
form a distinct ligature. This feature was historically present in Bengali, but it is rare in modern orthography. 
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tya dya nya bhya 5уа sya hya 
мәш © J 9 J YJ :g ЫЫ 
Bengali vj Wj a V ху HS $j 


LETTER RA Maithili repha is similar semantically to its analogue in Bengali, but it is graphically distinct. 
. . .1° c H . 

It is written above the following consonant letter: rpa in Maithili *Ї and in Bengali %. In some cases, repha 

merges with the consonant letter with which it is written to create a distinct ligature: rga in Maithili Ж and 

Bengali *f. 


LETTER LA Some consonant conjuncts involving LA are produced using the G phala form of LA. Some 


examples are kla $ (Bengali $), phla a (Bengali 35), ‘sla m (Bengali *)), 


LETTER VA Some consonant conjuncts involving vA are produced using the phala form of УА: va is rep- 
resented by the = va-phala. 
4.5.4 Conjunct Forms 


The Maithili and Bengali scripts have retained a more conservative position on the use of conjuncts. Even 
so, there are efforts to simplify the writing of conjuncts in Bengali, as has been done for Devanagari in the 
orthographies for Hindi and other modern languages. Maithili displays a tendency towards the retention of 
complex conjunct forms. This may change through increased use ofthe script in digital media. А comparison 
of conjunct forms in Maithili and Bengali is given in Table 5. 


S User Perception 


In addition to the technical considerations that guide a determination on whether or not to unify Maithili and 
Bengali, factors such as user perception of script distinctions and the support and development of fonts and 
a character-encoding standard must be also considered. 


5.1 User Perception of Script Distinctiveness 


User perception of a script and its distinctiveness is an important issue to consider when unifying scripts. 
When the printing of Maithili books was begun in the 1920s, metal fonts for Maithili were developed in 
Calcutta by modifying existing Bengali fonts. Bengali fonts were widely available and the characters were 
similar enough to Maithili, so that the cost of developing a set of Maithili fonts from existing sources was 
lesser than producing a set of fonts anew. Similarly, when Maithili digital fonts were developed in Nepal in 
2003, the corresponding forms in Bengali fonts were replaced by newly drawn Maithili-specific characters. 
If Maithili were perceived by its users as a variant of Bengali, printers and publishers would have simply 
produced Maithili books using Bengali metal and digital fonts. The distinction between the two scripts is 
also understood in Bengal, where Maithili is known as firufe, or ‘belonging to the Tirhuta region’.® 


The disunification of Greek and Coptic rested upon user perceptions of the distinctions between scripts. 
Michael Everson and Kamal Mansour advocated the disunification of Coptic and Greek in N2444 (2002) 
on the principle that “unification of scripts should imply that readers of a language are able to make sense, 
with relative ease, of a text written in a variant of the script." This rationale may be applied to the case for 


$ Chatterji 1926: 225. ^ Everson and Mansour 2002. 
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disunifying Maithili and Bengali. Consider the following: the Bengali term for the Bengali language and 
script, 411, is transliterated into Maithili as fal. While the style of the two words are similar and certain 
character elements are identical, the representations of the word bamia are entirely different. The difference 
is not one of style, but of character identity. A speaker of Bengali would not immediately recognize the 
name of his language as žial. The perceived differences between Maithili and Bengali suggests that despite 
the similarities, readers of Maithili and Bengali view their scripts not as variations upon a common model, 
but distinct writing systems belonging to distinct languages and linguistic communities. 


5.2 Support and Development 


User perception is critical for a character-encoding standard for any given script because the effectiveness 
of a standard is dependent upon its acceptance by the target user community. User perception regarding a 
script and the characters that constitute it is essential not only for the adoption of an encoding standard for 
a writing system, but also for continued support and development of fonts and other implementations. The 
unification of Maithili and Bengali would require that at least 40 Maithili-specific characters be added as 
extensions to the Bengali script. Such a unification binds the support and development of Maithili to the 
motivations and perceptions of Bengali font developers. 


Even if Maithili were added as a separate block named “Bengali Extensions", the support for such a block 
is dependent upon Bengali and implementers of Bengali. Extended characters are not always viewed pos- 
itively. An important example is Meetei Mayek. The Government of Manipur, India rejected a proposal 
for Meetei Mayek because it disagreed with the presence of historical characters in the code block, which 
are not considered part of the official modern orthography for the script.’ The characters in question were 
removed from the proposal and, ultimately, a proposal for adding an extended block of historical Meetei 
Mayek characters was also tabled. While political considerations should not guide character encoding, the 
standard for Meetei Mayek reflects the requirements of its users; the Government of Manipur being a key 
stakeholder. A standard encoding for Meetei Mayek containing characters, even those termed as extended 
characters, that are not perceived as part of the script by users was likely to be unadopted and unsupported. 


With specific reference to Bengali, questions have been raised regarding the presence of historical characters 
in the Bengali block and the necessity of maintaining them. Most recently, Bidyut Baran Chaudhuri of the 
Society for Natural Language Technology Research (Kolkata, India) suggested reformation of the Bengali 
code block. He stated that *[m]any code points are unnecessarily filled by some old (historical) character 
or modifier signs."? The reference is to the vocalic vowel letters. Chaudhari suggested that these historical 
characters be placed in the Private Use Area so that code points may be made available for other charac- 
ters, presumably those used in representing modern Bengali. If native Bengali users do not perceive the 
importance of maintaining historical Bengali characters, then presumptively there will be equal hesitation to 
accommodate and support Maithili characters, which are not used in Bengali. 


6 Conclusions 


The most appropriate method of representing Maithili in the UCS is to encode it as an independent script. 
While this approach would contradict the principle of unification, it would facilitate the representation of 
Maithili in plain text. An independent encoding for Maithili in the UCS will benefit the Maithili-speaking 
community by enabling it to adapt its traditional script for general communication and literary development 
through the digital medium. A standard encoding for the script will also enable scholars engaged in research 
on Maithili language and literature to preserve and reproduce Maithili manuscripts through digital technolo- 
gies. Moreover, organizations in South Asia are actively developing fonts and standards for representing 


* India. Government of Manipur 2008. ° Chaudhuri 2009. 
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the Maithili script. A standard encoding in Unicode would benefit technical groups that are implementing 
the Maithili language and script for use in information technology. Although Maithili is a historical script, 
the Maithili-speaking community has preserved it for traditional purposes and has recently revived its use 
through digital publishing. An encoding for Maithili in the UCS will enable users of the language and script 
to meet the needs of modern communication and computing technologies. 
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MAITHILI BENGALI MAITHILI BENGALI 
A $ $ no dependent form 
AA SJ XT e 1 Ji 
| я 3 f В 
© $ © 2 * 
UU $ © eu L 
VOCALIC R N a S C 
VOCALIC RR E 4 2 E 
VOCALIC L J D ч ES 
VOCALIC LL © % a A 
E 4 Ч G2 Ge 
= 
SHORT Е no independent form р = 


Al & & Gs без 
о S 8 G GA 


SHORT O no independent form ct : 3l 


AU © © CÀ CÀ 


Table 1: Comparison of Maithili and Bengali vowel letters and signs (differences highlighted). 
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KA 


KHA 


GA 


GHA 


NGA 


CA 


CHA 


JA 


JHA 


NYA 


TTHA 


DDDA 


DDHA 


RHA 


NNA 


TA 


THA 


MAITHILI 


ZA 3 AAA d er ce ДА S e «v ж дл YE 24 


BENGALI 


Aq 9 чач аа e^ ef f5 AGMA dé д ок 2 


DA 


DHA 


NA 


PA 


PHA 


BA 


BHA 


MA 


YA 


YYA 


RA 


LA 


VA 


SHA 


SA 


HA 


MAITHILI 


я 


3 2] Ж 


де 5 


л S д м о 9 A м м d 
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BENGALI 


5] 


А Mo» > д 


3| A M д ом d 


Table 2: Comparison of Maithili and Bengali consonant letters (differences highlighted). 
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MAITHILI BENGALI 


ZERO O O 
ow 9 N 
mo 2 9 
mee 3 © 
юж 8 8 
we A € 
x © v 
seven © а 
көт + v 
мм @ Ф 


Table 3: Comparison of Maithili and Bengali digits (differences highlighted). 


MAITHILI BENGALI 


NJ NJ 
CANDRABINDU S. Nue 
о 

ры 25,0 

ANUSVARA Loe RIN 

МО 24,0 

VISARGA Lito i26 
AVAGRAHA » » 
ANJI f — 
OM X3» 320 
GVANG W == 


Table 4: Comparison of Maithili and Bengali miscellaneous signs (differences highlighted). 
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MAITHILI BENGALI MAITHILI BENGALI 


S © £& cwm ак sk 2M xp x 23 al o8 


J 
aa & FAD S 
A э foi AA A AA c c ду A jd A Aad A d 


осо dj ey av GM ая f osboxx xp om S & S $6 
e а emssgs 9s € 5» 3 5 9 


Table 5: Comparison of conjunct forms used in Maithili and Bengali 
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1148 1149 114A 114B 114C 114D 


114CA 


KR 


NNN 


ММ 


КУЭМ 
SSSSSSSSSSSSSSS 


ААА 
ААА 


NSSSSNSNNNNS 
АХ 


ААА 


Table 6: Proposed code chart for Maithili 
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Various signs 


11480 MAITHILI SIGN CANDRABINDU 
11481 MAITHILI SIGN ANUSVARA 
11482 с: MAITHILI SIGN VISARGA 
Independent vowels 

11483 2 MAITHILI LETTER A 

11484 ЭЙ MAITHILI LETTER AA 
11485 @ MAITHILI LETTER I 

11486 @ MAITHILI LETTER II 

11487 © MAITHILI LETTER U 

11488 $ MAITHILI LETTER UU 
11489 & MAITHILI LETTER VOCALIC В 
1148A & MAITHILI LETTER VOCALIC RR 
1488 J MAITHILI LETTER VOCALIC L 
1148C Я MAITHILI LETTER VOCALIC LL 
11480 4 MAITHILI LETTER E 

1148E a MAITHILI LETTER AI 

1148F S$ MAITHILI LETTER О 

11490 8 MAITHILI LETTER AU 
Consonants 

11491 $ MAITHILI LETTER KA 
11492 4 MAITHILI LETTER KHA 
11493 Я MAITHILI LETTER GA 
11494 Ч MAITHILI LETTER GHA 
11495 © MAITHILI LETTER NGA 
11496 Б MAITHILI LETTER CA 
11497 & MAITHILI LETTER CHA 
11498 © MAITHILI LETTER JA 

11499 W MAITHILI LETTER JHA 
1149A < MAITHILI LETTER NYA 
1149B 3 MAITHILI LETTER TTA 
1149C 5 MAITHILI LETTER ТТНА 
1149D © MAITHILI LETTER DDA 
1149E © MAITHILI LETTER DDDA 
1149F © MAITHILI LETTER DDHA 
11440 © MAITHILI LETTER RHA 
114A1 т MAITHILI LETTER ММА 
114A2 5 MAITHILI LETTER TA 
11443 Ч MAITHILI LETTER THA 
11444 и MAITHILI LETTER DA 
114A5 4 MAITHILI LETTER DHA 
11446 MAITHILI LETTER NA 
114А7 Я MAITHILI LETTER PA 
11448 % MAITHILI LETTER PHA 
11449 4 MAITHILI LETTER BA 
114AA © MAITHILI LETTER BHA 
114AB * MAITHILI LETTER MA 
114AC < MAITHILI LETTER YA 
114AD < MAITHILI LETTER YYA 
114AE 4 MAITHILI LETTER RA 
114AF 4 MAITHILI LETTER LA 
11480 4 MAITHILI LETTER LLA 
11481 4 MAITHILI LETTER VA 
1148? *| MAITHILI LETTER SHA 
11483 Я MAITHILI LETTER SSA 
114B4 Я MAITHILI LETTER SA 
11485 © MAITHILI LETTER HA 


Dependent vowel signs 


11486 ct 
11487 Р 
11488 5 
114B9 
114BA 
114BB 
114BC 
114BD 


MAITHILI VOWEL SIGN AA 
MAITHILI VOWEL SIGNI 


Я MAITHILI VOWEL SIGN II 


MAITHILI VOWEL SIGN U 
MAITHILI VOWEL SIGN UU 


MAITHILI VOWEL SIGN VOCALIC R 
MAITHILI VOWEL SIGN VOCALIC RR 
MAITHILI VOWEL SIGN VOCALIC L 


ШАВЕ © 
1146F со 
11400 5 
114С1 © 
114C2 cot 
14C3 Я 
114CA C3 
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MAITHILI VOWEL SIGN VOCALIC LL 
MAITHILI VOWEL SIGN E 

MAITHILI VOWEL SIGN SHORT E 
MAITHILI VOWEL SIGN AI 
MAITHILI VOWEL SIGN O 

MAITHILI VOWEL SIGN SHORT O 
MAITHILI VOWEL SIGN AU 


Various signs 


114C5 
11406 
114С7 
114C8 
114C9 
114CA 
Digits 
11400 
11401 
11402 
11403 
11404 
11405 
11406 
11407 
11408 
11409 


= (NS o RO 


2+ э е ©с мш уду го 


MAITHILI SIGN VIRAMA 
MAITHILI SIGN AVAGRAHA 
MAITHILI ABBREVIATION SIGN 
MAITHILI ANJI 

MAITHILI OM 

MAITHILI GVANG 


MAITHILI DIGIT ZERO 
MAITHILI DIGIT ONE 
MAITHILI DIGIT TWO 
MAITHILI DIGIT THREE 
MAITHILI DIGIT FOUR 
MAITHILI DIGIT FIVE 
MAITHILI DIGIT SIX 
MAITHILI DIGIT SEVEN 
MAITHILI DIGIT EIGHT 
MAITHILI DIGIT NINE 


Table 7: Proposed names list for Maithili 


